Page 1 of 9 





PCT 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/538,471 



DATE: 06/22/2005 
TIME: 10:19:58 



Input Set : A:\PTO.RJ.txt 

Output Set: N:\CRP4\06222005\J538471.raw 

3 <110> APPLICANT: Balakireva, Larissa 

5 <120> TITLE OF INVENTION: MOLECULES INHIBITING HEPATITIS C VIRUS PROTEIN SYNTHESIS AND 
METHOD FOR 



C-- 
C-- 



w--> 



w- 
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36 
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SCREENING SAME 
<130> FILE REFERENCE: 1759.200 

<140> CURRENT APPLICATION NUMBER: US/10/538,471 
<141> CURRENT FILING DATE: 2005-06-03 

<150> PRIOR APPLICATION NUMBER: PCT/FR03/03675 
<151> PRIOR FILING DATE: 2003-12-11 
<150> PRIOR APPLICATION NUMBER: FR0215718 
<151> PRIOR FILING DATE: 2002-12-12 
<160> NUMBER OF SEQ ID NOS : 16 
<170> SOFTWARE: PatentIn version 3.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 326 
<212> TYPE: DNA 

<213> ORGANISM: 3T±J^ficial-^equencg, 



<220> FEATURg 3^^21> H CV^^ ■ 



60 
120 
180 
240 

300 
326 



<222> LOCATION": 4U. .iVI 

<223> OTHER INFORMATION: corresponds to IRES sequence of HCV 
<400> SEQUENCE: 1 

ctcccctgtg aagaactact gtcttcacgc agaaagcgtc tagccatggc gttagtatga 
gtgtcgtgca gcctccagga ccccccctcc cgggagagcc atagtggtct gcggaaccgg 
tgagtacacc ggaattgcca ggatgaccgg gtcctttctt ggatcaaccc gctcaatgcc 
tggagatttg ggcgtgcccc cgcgagactg ctagccgagt agtgttgggt cgcgaaaggc 
cttgtggtac tgcctgatag ggtgcttgcg agtgccccgg gaggtctcgt agaccgtgca 
tcatgagcac aaatcctaaa gaaaaa 
<210> SEQ ID NO: 2 
LENGTH: 80 

TYPE: DNA ^ ClA^^"^^"^ 

ORGANISM : Artificial Sequence^ 
FEATURB^]^21> HCV~^ ^ 
LOCATIONr-4^-rT^r3r9-'"^ 

OTHER INFORMATION: corresponds to a portion (region II) of HCV IRES 
SEQUENCE: 2 

ctcccctgtg aggaactact gtcttcacgc agaaagcgtc tagccatggc gttagtatga 60 
gtgttgtgca gcctccagga 80 
<210> SEQ ID NO: 3 
LENGTH: 37 
TYPE: DNA 

ORGANI SM i^r t jficia 1 Sequence 
FEATURE :(<2 2 1> HCV 
LOCATION: 

OTHER INFORMATION: corresponds to a portion (consensus sequence) of HCV IRES 



<211> 
<212> 
<213> 
<220> 
<222> 
<223> 
<400> 



<211> 
<212> 
<213> 
<220> 
<222> 
<223> 



secjuence 
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W--> 80 



70 
72 
75 
76 
77 
78 




37 



81 <222> LOCATIi 






L4 






















82 <223> OTHER : 


INFORMATION: 


corresponds 


to pll6 


subunit 


of eIF3 




84 <400> SEQUENCE: 4 
























86 Met Gin Asp Ala Glu Asn Val Ala Val Pro Glu Ala Ala Glu Glu Arg 


87 1 






5 








10 








15 




90 Ala Glu Pro Gly Gin Gin Gin Pro Ala Ala Glu Pro Pro Pro Ala Glu 


91 




20 








25 








30 . 






94 Gly Leu Leu Arg Pro Ala Gly Pro Gly Ala Pro Glu Ala Ala Gly Thr 


95 


35 








40 








45 








98 Glu Ala Ser Ser Glu Glu Val Gly He Ala Glu Ala Gly Pro Glu Pro 


99 50 








55 








60 










102 Glu 


Val 


Arg 


Thr 


Glu 


Pro 


Ala 


Ala 


Glu 


Ala 


Glu 


Ala 


Ala 


Ser 


Gly 


Pro 


103 65 










70 










75 










80 


106 Ser 


Glu 


Ser 


Pro 


Ser 


Pro 


Pro 


Ala 


Ala 


Glu 


Glu 


Leu 


Pro 


Gly 


Ser 


His 


107 








85 










90 










95 




110 Ala 


Glu 


Pro 


Pro 


Val 


Pro 


Ala 


Gin 


Gly 


Glu 


Ala 


Pro 


Gly 


Glu 


Gin 


Ala 


111 






100 










105 










110 






114 Arg 


Asp 


Glu 


Arg 


Ser 


Asp 


Ser 


Arg 


Ala 


Gin 


Ala 


Val 


Ser 


Glu 


Asp 


Ala 


115 




115 










120 










125 








118 Gly 


Gly 


Asn 


Glu 


Gly 


Arg 


Ala 


Ala 


Glu 


Ala 


Glu 


Pro 


Arg 


Ala 


Leu 


Glu 


119 


130 










135 










140 










122 Asn 


Gly 


Asp 


Ala 


Asp 


Glu 


Pro 


Ser 


Phe 


Ser 


Asp 


Pro 


Glu 


Asp 


Phe 


Val 


123 145 










150 










155 










160 


126 Asp 


Asp 


Val 


Ser 


Glu 


Glu 


Glu 


Leu 


Leu 


Gly 


Asp 


Val 


Leu 


Lys 


Asp 


Arg 


127 








165 










170 










175 




130 Pro 


Gin 


Glu 


Ala 


Asp 


Gly 


He 


Asp 


Ser 


Val 


He 


Val 


Val 


Asp 


Asn 


Val 


131 






180 










185 










190 






134 Pro 


Gin 


Val 


Gly 


Pro 


Asp 


Arg 


Leu 


Glu 


Lys 


Leu 


Lys 


Asn 


Val 


He 


His 


135 




195 










200 










205 








138 Lys 


He 


Phe 


Ser 


Lys 


Phe 


Gly 


Lys 


He 


Thr 


Asn 


Asp 


Phe 


Tyr 


Pro 


Glu 


139 


210 










215 










220 










142 Glu 


Asp 


Gly 


Lys 


Thr 


Lys 


Gly 


Tyr 


He 


Phe 


Leu 


Glu 


Tyr 


Ala 


Ser 


Pro 


143 225 










230 










235 










240 


146 Ala 


His 


Ala 


Val 


Asp 


Ala 


Val 


Lys 


Asn 


Ala 


Asp 


Gly 


Tyr 


Lys 


Leu 


Asp 


147 








245 










250 










255 




150 Lys 


Gin 


His 


Thr 


Phe 


Arg 


Val 


Asn 


Leu 


Phe 


Thr 


Asp 


Phe 


Asp 


Lys 


Tyr 


151 






260 










265 










270 






154 Met 


Thr 


He 


Ser 


Asp 


Glu 


Trp 


Asp 


He 


Pro 


Glu 


Lys 


Gin 


Pro 


Phe 


Lys 


155 




275 










280 










285 








158 Asp 


Leu 


Gly 


Asn 


Leu 


Arg 


Tyr 


Trp 


Leu 


Glu 


Glu 


Ala 


Glu 


Cys 


Arg 


Asp 


159 


290 










295 










300 










162 Gin 


Tyr 


Ser 


Val 


He 


Phe 


Glu 


Ser 


Gly 


Asp 


Arg 


Thr 


Ser 


He 


Phe 


Trp 
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163 


305 






310 








315 








320 


166 


Asn 


Asp Val 


Lys Asp Pro Val 


Ser 


He 


Glu 


Glu 


Arg 


Ala 


Arg 


Trp Thr 


167 








325 






330 










335 


170 


Glu 


Thr 


Tyr 


Val Arg Trp Ser 


Pro 


Lys 


Gly 


Thr 


Tyr 


Leu 


Ala 


Thr Phe 


171 








340 




345 










350 




174 


His 


Gin Arg 


Gly He Ala Leu 


Trp 


Gly 


Gly 


Glu 


Lys 


Phe 


Lys 


Gin He 


175 






355 




360 










365 






178 


Gin 


Arg 


Phe 


Ser His Gin Gly 


Val 


Gin 


Leu 


He 


Asp 


Phe 


Ser 


Pro Cys 


179 




370 




375 










380 








182 


Glu 


Arg Tyr 


Leu Val Thr Phe 


Ser 


Pro 


Leu 


Met 


Asp 


Thr 


Gin Asp Asp 


183 


385 






390 








395 








400 


186 


Pro 


Gin 


Ala 


He He He Trp 


Asp 


He 


Leu 


Thr 


Gly 


His 


Lys 


Lys Arg 


187 








405 






410 










415 


190 


Gly 


Phe 


His 


Cys Glu Ser Ser 


Ala 


His 


Trp 


Pro 


He 


Phe 


Lys 


Trp Ser 


191 








420 




425 










430 




194 


His 


Asp 


Gly 


Lys Phe Phe Ala 


Arg 


Met 


Thr 


Leu 


Asp 


Thr 


Leu 


Ser He 


195 






435 




440 










445 






198 


Tyr 


Glu 


Thr 


Pro Ser Met Gly 


Leu 


Leu 


Asp 


Lys 


Lys 


Ser 


Leu 


Lys He 


199 




450 




455 










460 








202 


Ser 


Gly He 


Lys Asp Phe Ser 


Trp 


Ser 


Pro 


Gly 


Gly 


Asn 


He 


He Ala 


203 


465 






470 








475 








480 


206 


Phe 


Trp 


Val 


Pro Glu Asp Lys 


Asp 


He 


Pro 


Ala 


Arg 


Val 


Thr 


Leu Met 


207 








485 






490 










495 


210 


Gin 


Leu 


Pro 


Thr Arg Gin Glu 


He 


Arg 


Val 


Arg 


Asn 


Leu 


Phe 


Asn Val 


211 








500 




505 










510 




214 


Val 


Asp 


Cys 


Lys Leu His Trp 


Gin 


Lys 


Asn 


Gly 


Asp 


Tyr 


Leu Cys Val 


215 






515 




520 










525 






218 


Lys 


Val 


Asp 


Arg Thr Pro Lys 


Gly 


Thr 


Gin 


Gly 


Val 


Val 


Thr 


Asn Phe 


219 




530 




535 










540 








222 


Glu 


Ile^ 


Phe 


Arg Met Arg Glu 


Lys 


Gin 


Val 


Pro 


Val 


Asp 


Val 


Val Glu 


223 


545 






550 








555 








560 


226 


Met 


Lys 


Glu 


Thr He He Ala 


Phe 


Ala 


Trp 


Glu 


Pro 


Asn 


Gly 


Ser Lys 


227 








565 






570 










575 


230 


Phe 


Ala 


Val 


Leu His Gly Glu 


Ala 


Pro 


Arg 


He 


Ser 


Val 


Ser 


Phe Tyr 


231 








580 




585 










590 




234 


His 


Val 


Lys 


Asn Asn Gly Lys 


He 


Glu 


Leu 


He 


Lys 


Met 


Phe Asp Lys 


235 






595 




600 










605 






238 


Gin 


Gin 


Ala 


Asn Thr He Phe 


Trp 


Ser 


Pro 


Gin 


Gly 


Gin 


Phe 


Val Val 


239 




610 




615 










620 








242 


Leu 


Ala Gly 


Leu Arg Ser Met 


Asn 


Gly 


Ala 


Leu 


Ala 


Phe 


Val Asp Thr 


243 


625 






630 








635 








640 


246 


Ser 


Asp Cys 


Thr Val Met Asn 


He 


Ala 


Glu 


His 


Tyr 


Met 


Ala 


Ser Asp 


247 








645 






650 










655 


250 


Val 


Glu Trp 


Asp Pro Thr Gly 


Arg 


Tyr 


Val 


Val 


Thr 


Ser 


Val 


Ser Trp 


251 








660 




665 










670 




254 


Trp 


Ser 


His 


Lys Val Asp Asn 


Ala 


Tyr 


Trp 


Leu 


Trp 


Thr 


Phe Gin Gly 


255 






675 




680 










685 






258 


Arg 


Leu 


Leu 


Gin Lys Asn Asn 


Lys 


Asp 


Arg 


Phe 


Cys 


Gin 


Leu 


Leu Trp 


259 




690 




695 










700 
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262 Arg Pro Arg Pro Pro Thr Leu Leu Ser Gin Glu Gin lie Lys Gin lie 

263 705 710 715 720 

266 Lys Lys Asp Leu Lys Lys Tyr Ser Lys lie Phe Glu Gin Lys Asp Arg 

267 725 730 735 

270 Leu Ser Gin Ser Lys Ala Ser Lys Glu Leu Val Glu Arg Arg Arg Thr 

271 740 745 750 

274 Met Met Glu Asp Phe Arg Lys Tyr Arg Lys Met Ala Gin Glu Leu Tyr 

275 755 760 765 

278 Met Glu Gin Lys Asn Glu Arg Leu Glu Leu Arg Gly Gly Val Asp Thr 

279 770 775 780 

282 Asp Glu Leu Asp Ser Asn Val Asp Asp Trp Glu Glu Glu Thr lie Glu 

283 785 790 795 800 

2 86 Phe Phe Val Thr Glu Glu He He Pro Leu Gly Asn Gin Glu 
287 805 810 

290 <210> SEQ ID NO: 5 

291 <211> LENGTH: 106 

292 <212> TYPE: PRT 

293 <213> ORGANISM: ArtjJiiiial Sequence pOA"*^ 
W--> 295 <220> FEATURE: ^521> pll6]]^^>^ 

296 <222> LOCATION :Srf5-r:^r7g 

297 <223> OTHER INFORMATION: corresponds to a portion (RRM) of eIF3 pll6 subunit 
299 <400> SEQUENCE: 5 

301 Met Asp Arg Pro Gin Glu Ala Asp Gly He Asp Ser Val He Val Val 

302 15 10 15 

3 05 Asp Asn Val Pro Gin Val Gly Pro Asp Arg Leu Glu Lys Leu Lys Asn 
306 20 25 30 

309 Val He His Lys He Phe Ser Lys Phe Gly Lys He Thr Asn Asp Phe 

310 35 40 45 

313 Tyr Pro Glu Glu Asp Gly Lys Thr Lys Gly Tyr He Phe Leu Glu Tyr 

314 50 55 60 

317 Ala Ser Pro Ala His Ala Val Asp Ala Val Lys Asn Ala Asp Gly Tyr 

318 65 70 75 80 

321 Lys Leu Asp Lys Gin His Thr Phe Arg Val Asn Leu Phe Thr Asp Phe 

322 85 90 95 

325 Asp Lys Tyr Met Thr He Ser Asp Glu Trp 

326 100 105 

329 <210> SEQ ID NO: 6 

330 <211> LENGTH: 33 

331 <212> TYPE: DNA 

332 <213> ORGANISM: Artif] 
W--> 334 <220> FEATURE: 

335 <222> LOCATION: 

336 <223> OTHER INFORMATION: HCV RRM 5' primer (RRMfwd) 
338 <400> SEQUENCE: 6 

340 catatggatc ggccccagga agcagatgga ate 33 

343 <210> SEQ ID NO: 7 

344 <211> LENGTH: 33 

345 <212> TYPE: DNA 

346 <213> ORGANISM: Artificial Sequence 
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348 <220> FEATURE: <^221> primer^^jjid^ 

349 <222> LOCATION: TTTTB 



350 <223> OTHER INFORMATION: HCV RRM 3» primer (RRMrev) 

352 <400> SEQUENCE: 7 

354 gtgctcgagc cactcgtcac tgatcgtcat ata 33 

357 <210> SEQ ID NO: 8 

358 <211> LENGTH: 29 

359 <212> TYPE: DNA 

360 <213> ORGANISM: Arti ficial Sequen ce ^^i^*-^-^ 
W--> 362 <220> FEATURE: ^<^^T>^rimer_bin3;;;:;> C 

363 <222> LOCATION :^Tr-rTa5i — 

364 <223> OTHER INFORMATION: HCV IRES 5' primer (IRESfwd) 

366 <400> SEQUENCE: 8 

368 accgctagcc tcccctgtga ggaactact 29 

371 <210> SEQ ID NO: 9 

372 <211> LENGTH: 46 

373 <212> TYPE: DNA 

374 <213> ORGANISM: A rtificial Sequence jt 

W--> 376 <220> FEATURE :^<521> primer bind ^ 

377 <222> LOCATION^^-i-v-^46 

378 <223> OTHER INFORMATION: HCV IRES 3' primer (IRESrev) 
380 <400> SEQUENCE: 9 

382 gaaagctttt ttctttgagg tttaggattt gtgctcatga tgcacg 46 

385 <210> SEQ ID NO: 10 

386 <211> LENGTH: 95 

387 <212> TYPE: DNA 

388 <213> ORGANISM: A rtificial Sequ ence 
W--> 390 <220> FEATURE :<;;;^el21> primer bina:^ & 

3 91 <222> LOCATION ^-^^^......23 - — 

392 <223> OTHER INFORMATION: primer Illabcfwd which corresponds to T7 polymerase promoter 
+ 139-215 of 

393 HCV (regions Illa-IIIb) 
395 <400> SEQUENCE: 10 

397 taatacgact cactataggg tagtggtctg cggaaccggt gagtacaccg gaattgccag 60 

399 gacgaccggg tcctttcttg gataaacccg ctcaa 95 

402 <210> SEQ ID NO: 11 

403 <211> LENGTH: 60 

404 <212> TYPE: DNA 

405 <213> ORGANISM: ArtjJi,cial _Sequence ^A^*'*^-^ 
W--> 407 <220> FEATURE: <^221> primer[_bind ^ 

408 <222> LOCATION: rrr^^ 

409 <223> OTHER INFORMATION: primer Illabcrev which corresponds to 193-252 of HCV^ 
(regions Illb-IIIc) 

41*3 <400> SEQUENCE: 11 

415 tagcagtctc gcgggggcac gcccaaatct ccaggcattg agcgggttga tccaagaaag 60 

418 <210> SEQ ID NO: 12 

419 <211> LENGTH: 20 
42 0 <212> TYPE: DNA 

421 <213> ORGANISM: Axt±f4r€4al--.Seguence \ ^^t^u*^ 

W--> 423 <220> FEATURE: (^1> primer_bind ^ ^ '^"'^^ 

424 <222> LOCATION: 1 
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Invalid Line Length; 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:10; Line(s) 392 
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L:ll M:270 C: Current Application Number differs. Replaced Current Application No 

L:ll M:271 C: Current Filing Date differs. Replaced Current Filing Date 

L:26 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 

L:50 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 

L:66 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 

L:80 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 



L: 


295 


M: 


256 


W: 


Invalid 


Numeric 


Header Field, 


<220> 


has 


non 


-blank 


data 


L: 


334 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


348 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


362 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


376 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


390 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


407 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


423 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


437 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


453 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 


L: 


467 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


noil 


-blank 


data 


L: 


481 


M: 


256 


W: 


Invalid 


Numeric 


Header 


Field, 


<220> 


has 


non 


-blank 


data 
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